Overview

Dataset statistics

Number of variables27
Number of observations296
Missing cells415
Missing cells (%)5.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory75.3 KiB
Average record size in memory260.4 B

Variable types

NUM15
BOOL8
CAT3
DATE1

Reproduction

Analysis started2020-05-05 17:15:33.933246
Analysis finished2020-05-05 17:16:17.190420
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
month is highly correlated with quarter and 1 other fieldsHigh Correlation
quarter is highly correlated with month and 1 other fieldsHigh Correlation
weekofyear is highly correlated with quarter and 1 other fieldsHigh Correlation
meanwd_udsprevisionempresa is highly correlated with meanwd_udsventaHigh Correlation
meanwd_udsventa is highly correlated with meanwd_udsprevisionempresaHigh Correlation
udsstock has 98 (33.1%) missing values Missing
udsventa has 63 (21.3%) missing values Missing
udsprevisionempresa has 81 (27.4%) missing values Missing
roll4wd_udsventa has 50 (16.9%) missing values Missing
meanwd_udsventa has 42 (14.2%) missing values Missing
roll4wd_udsstock has 17 (5.7%) missing values Missing
roll4wd_udsprevisionempresa has 64 (21.6%) missing values Missing
weekday has 42 (14.2%) zeros Zeros
sin_weekday has 42 (14.2%) zeros Zeros

Variables

df_index
Real number (ℝ≥0)

UNIFORM
UNIQUE
Distinct count296
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10650.0
Minimum30
Maximum21270
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum30
5-th percentile1092
Q15340
median10650
Q315960
95-th percentile20208
Maximum21270
Range21240
Interquartile range (IQR)10620

Descriptive statistics

Standard deviation6162.628011
Coefficient of variation (CV)0.5786505174
Kurtosis-1.2
Mean10650
Median Absolute Deviation (MAD)5328
Skewness0
Sum3152400
Variance37977984
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 30. 21270.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
11334 1 0.3%
 
10398 1 0.3%
 
10686 1 0.3%
 
2694 1 0.3%
 
18822 1 0.3%
 
16878 1 0.3%
 
9750 1 0.3%
 
9894 1 0.3%
 
10038 1 0.3%
 
2190 1 0.3%
 
Other values (286) 286 96.6%
 
ValueCountFrequency (%) 
30 1 0.3%
 
102 1 0.3%
 
174 1 0.3%
 
246 1 0.3%
 
318 1 0.3%
 
ValueCountFrequency (%) 
21270 1 0.3%
 
21198 1 0.3%
 
21126 1 0.3%
 
21054 1 0.3%
 
20982 1 0.3%
 

fecha
Date

UNIFORM
UNIQUE
Distinct count296
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
Minimum2019-06-05 00:00:00
Maximum2020-03-26 00:00:00
Histogram

producto
Categorical

CONSTANT
REJECTED
Distinct count1
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
42
296
ValueCountFrequency (%) 
42 296 100.0%
 

Length

Max length2
Mean length2
Min length2
ValueCountFrequency (%) 
Decimal_Number 2 100.0%
 
ValueCountFrequency (%) 
Common 2 100.0%
 
ValueCountFrequency (%) 
ASCII 2 100.0%
 

udsstock
Real number (ℝ≥0)

MISSING
Distinct count107
Unique (%)54.0%
Missing98
Missing (%)33.1%
Infinite0
Infinite (%)0.0%
Mean873.520202020202
Minimum116.0
Maximum2209.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum116
5-th percentile324.65
Q1603.25
median809
Q31092.5
95-th percentile1564.55
Maximum2209
Range2093
Interquartile range (IQR)489.25

Descriptive statistics

Standard deviation391.7146313
Coefficient of variation (CV)0.4484322519
Kurtosis0.6721510347
Mean873.520202
Median Absolute Deviation (MAD)305.5072952
Skewness0.7127143265
Sum172957
Variance153440.3524
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
736 9 3.0%
 
765 7 2.4%
 
649 4 1.4%
 
475 4 1.4%
 
794 4 1.4%
 
610 4 1.4%
 
862 4 1.4%
 
571 4 1.4%
 
1017 4 1.4%
 
1085 4 1.4%
 
Other values (97) 150 50.7%
 
(Missing) 98 33.1%
 
ValueCountFrequency (%) 
116 1 0.3%
 
135 2 0.7%
 
145 1 0.3%
 
174 1 0.3%
 
213 1 0.3%
 
ValueCountFrequency (%) 
2209 1 0.3%
 
1967 3 1.0%
 
1889 1 0.3%
 
1841 1 0.3%
 
1763 2 0.7%
 

udsventa
Real number (ℝ≥0)

MISSING
Distinct count96
Unique (%)41.2%
Missing63
Missing (%)21.3%
Infinite0
Infinite (%)0.0%
Mean538.7467811158798
Minimum88.0
Maximum1830.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum88
5-th percentile252
Q1391
median523
Q3656
95-th percentile853.8
Maximum1830
Range1742
Interquartile range (IQR)265

Descriptive statistics

Standard deviation230.7791531
Coefficient of variation (CV)0.4283629363
Kurtosis8.000659612
Mean538.7467811
Median Absolute Deviation (MAD)164.541583
Skewness1.882204417
Sum125528
Variance53259.0175
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
546 8 2.7%
 
420 6 2.0%
 
487 6 2.0%
 
523 5 1.7%
 
280 5 1.7%
 
605 5 1.7%
 
701 5 1.7%
 
597 5 1.7%
 
479 5 1.7%
 
575 4 1.4%
 
Other values (86) 179 60.5%
 
(Missing) 63 21.3%
 
ValueCountFrequency (%) 
88 1 0.3%
 
154 1 0.3%
 
177 2 0.7%
 
184 1 0.3%
 
199 1 0.3%
 
ValueCountFrequency (%) 
1830 1 0.3%
 
1815 1 0.3%
 
1424 1 0.3%
 
1298 1 0.3%
 
1107 1 0.3%
 

udsprevisionempresa
Real number (ℝ≥0)

MISSING
Distinct count202
Unique (%)94.0%
Missing81
Missing (%)27.4%
Infinite0
Infinite (%)0.0%
Mean2763.753488372093
Minimum85.0
Maximum15421.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum85
5-th percentile355.3
Q11224.5
median2280
Q33680
95-th percentile6412.6
Maximum15421
Range15336
Interquartile range (IQR)2455.5

Descriptive statistics

Standard deviation2253.619448
Coefficient of variation (CV)0.8154198475
Kurtosis7.605839069
Mean2763.753488
Median Absolute Deviation (MAD)1615.708772
Skewness2.119297
Sum594207
Variance5078800.617
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3285 3 1.0%
 
960 2 0.7%
 
88 2 0.7%
 
4752 2 0.7%
 
369 2 0.7%
 
206 2 0.7%
 
1160 2 0.7%
 
5502 2 0.7%
 
1974 2 0.7%
 
2386 2 0.7%
 
Other values (192) 194 65.5%
 
(Missing) 81 27.4%
 
ValueCountFrequency (%) 
85 1 0.3%
 
88 2 0.7%
 
134 1 0.3%
 
147 1 0.3%
 
172 1 0.3%
 
ValueCountFrequency (%) 
15421 1 0.3%
 
13811 1 0.3%
 
12574 1 0.3%
 
8348 1 0.3%
 
8230 1 0.3%
 

promo
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
280
1
 
16
ValueCountFrequency (%) 
0 280 94.6%
 
1 16 5.4%
 

festivo
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
288
1
 
8
ValueCountFrequency (%) 
0 288 97.3%
 
1 8 2.7%
 

weekday
Real number (ℝ≥0)

ZEROS
Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.9966216216216215
Minimum0
Maximum6
Zeros42
Zeros (%)14.2%
Memory size2.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q35
95-th percentile6
Maximum6
Range6
Interquartile range (IQR)4

Descriptive statistics

Standard deviation1.997453142
Coefficient of variation (CV)0.6665683542
Kurtosis-1.241520413
Mean2.996621622
Median Absolute Deviation (MAD)1.706560446
Skewness0.004680305814
Sum887
Variance3.989819056
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.5 5.5 6. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3 43 14.5%
 
2 43 14.5%
 
6 42 14.2%
 
5 42 14.2%
 
4 42 14.2%
 
1 42 14.2%
 
0 42 14.2%
 
ValueCountFrequency (%) 
0 42 14.2%
 
1 42 14.2%
 
2 43 14.5%
 
3 43 14.5%
 
4 42 14.2%
 
ValueCountFrequency (%) 
6 42 14.2%
 
5 42 14.2%
 
4 42 14.2%
 
3 43 14.5%
 
2 43 14.5%
 

quarter
Categorical

HIGH CORRELATION
Distinct count4
Unique (%)1.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
4
92
3
92
1
86
2
26
ValueCountFrequency (%) 
4 92 31.1%
 
3 92 31.1%
 
1 86 29.1%
 
2 26 8.8%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

month
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count10
Unique (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.993243243243243
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum1
5-th percentile1
Q13
median8
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)7

Descriptive statistics

Standard deviation3.667533456
Coefficient of variation (CV)0.5244395666
Kurtosis-1.215710455
Mean6.993243243
Median Absolute Deviation (MAD)3.109751644
Skewness-0.3478227975
Sum2070
Variance13.45080165
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 2.5 6.5 11.5 12. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
12 31 10.5%
 
10 31 10.5%
 
8 31 10.5%
 
7 31 10.5%
 
1 31 10.5%
 
11 30 10.1%
 
9 30 10.1%
 
2 29 9.8%
 
6 26 8.8%
 
3 26 8.8%
 
ValueCountFrequency (%) 
1 31 10.5%
 
2 29 9.8%
 
3 26 8.8%
 
6 26 8.8%
 
7 31 10.5%
 
ValueCountFrequency (%) 
12 31 10.5%
 
11 30 10.1%
 
10 31 10.5%
 
9 30 10.1%
 
8 31 10.5%
 

weekofyear
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count43
Unique (%)14.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.469594594594593
Minimum1
Maximum52
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum1
5-th percentile3
Q111
median31
Q342
95-th percentile50
Maximum52
Range51
Interquartile range (IQR)31

Descriptive statistics

Standard deviation15.97664889
Coefficient of variation (CV)0.561182873
Kurtosis-1.229228509
Mean28.46959459
Median Absolute Deviation (MAD)13.65613587
Skewness-0.3266565044
Sum8427
Variance255.2533097
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 12.5 23.5 52. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
52 7 2.4%
 
51 7 2.4%
 
29 7 2.4%
 
28 7 2.4%
 
27 7 2.4%
 
26 7 2.4%
 
25 7 2.4%
 
24 7 2.4%
 
12 7 2.4%
 
11 7 2.4%
 
Other values (33) 226 76.4%
 
ValueCountFrequency (%) 
1 7 2.4%
 
2 7 2.4%
 
3 7 2.4%
 
4 7 2.4%
 
5 7 2.4%
 
ValueCountFrequency (%) 
52 7 2.4%
 
51 7 2.4%
 
50 7 2.4%
 
49 7 2.4%
 
48 7 2.4%
 
Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size424.0 B
True
246
False
50
ValueCountFrequency (%) 
True 246 83.1%
 
False 50 16.9%
 

sin_weekday
Real number (ℝ)

ZEROS
Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.004759498821957385
Minimum-0.9749279121818236
Maximum0.9749279121818236
Zeros42
Zeros (%)14.2%
Memory size2.4 KiB

Quantile statistics

Minimum-0.9749279122
5-th percentile-0.9749279122
Q1-0.7818314825
median0
Q30.7818314825
95-th percentile0.9749279122
Maximum0.9749279122
Range1.949855824
Interquartile range (IQR)1.563662965

Descriptive statistics

Standard deviation0.7086201304
Coefficient of variation (CV)148.8854514
Kurtosis-1.50521649
Mean0.004759498822
Median Absolute Deviation (MAD)0.6270716718
Skewness-0.0106157593
Sum1.408811651
Variance0.5021424891
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-0.97492791 -0.8783797 0.8783797 0.97492791], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.4338837391 43 14.5%
 
0.9749279122 43 14.5%
 
-0.4338837391 42 14.2%
 
-0.9749279122 42 14.2%
 
-0.7818314825 42 14.2%
 
0.7818314825 42 14.2%
 
0 42 14.2%
 
ValueCountFrequency (%) 
-0.9749279122 42 14.2%
 
-0.7818314825 42 14.2%
 
-0.4338837391 42 14.2%
 
0 42 14.2%
 
0.4338837391 43 14.5%
 
ValueCountFrequency (%) 
0.9749279122 43 14.5%
 
0.7818314825 42 14.2%
 
0.4338837391 43 14.5%
 
0 42 14.2%
 
-0.4338837391 42 14.2%
 

cos_weekday
Real number (ℝ)

Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.0037955736549281846
Minimum-0.9009688679024191
Maximum1.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum-0.9009688679
5-th percentile-0.9009688679
Q1-0.9009688679
median-0.222520934
Q30.6234898019
95-th percentile1
Maximum1
Range1.900968868
Interquartile range (IQR)1.52445867

Descriptive statistics

Standard deviation0.7079619739
Coefficient of variation (CV)-186.5230498
Kurtosis-1.503349059
Mean-0.003795573655
Median Absolute Deviation (MAD)0.6408877408
Skewness0.009053080122
Sum-1.123489802
Variance0.5012101565
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-0.90096887 -0.90096887 1. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-0.222520934 43 14.5%
 
-0.9009688679 43 14.5%
 
-0.222520934 42 14.2%
 
-0.9009688679 42 14.2%
 
0.6234898019 42 14.2%
 
1 42 14.2%
 
0.6234898019 42 14.2%
 
ValueCountFrequency (%) 
-0.9009688679 42 14.2%
 
-0.9009688679 43 14.5%
 
-0.222520934 42 14.2%
 
-0.222520934 43 14.5%
 
0.6234898019 42 14.2%
 
ValueCountFrequency (%) 
1 42 14.2%
 
0.6234898019 42 14.2%
 
0.6234898019 42 14.2%
 
-0.222520934 43 14.5%
 
-0.222520934 42 14.2%
 

is_august
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
265
1
 
31
ValueCountFrequency (%) 
0 265 89.5%
 
1 31 10.5%
 

spring
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
291
1
 
5
ValueCountFrequency (%) 
0 291 98.3%
 
1 5 1.7%
 

summer
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
188
1
108
ValueCountFrequency (%) 
0 188 63.5%
 
1 108 36.5%
 

autumn
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
206
1
90
ValueCountFrequency (%) 
0 206 69.6%
 
1 90 30.4%
 

winter
Boolean

Distinct count2
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
200
1
96
ValueCountFrequency (%) 
0 200 67.6%
 
1 96 32.4%
 

stockMissingType
Categorical

Distinct count3
Unique (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
0
198
2
87
1
 
11
ValueCountFrequency (%) 
0 198 66.9%
 
2 87 29.4%
 
1 11 3.7%
 

Length

Max length3
Mean length3
Min length3
ValueCountFrequency (%) 
Decimal_Number 3 75.0%
 
Other_Punctuation 1 25.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

roll4wd_udsventa
Real number (ℝ≥0)

MISSING
Distinct count239
Unique (%)97.2%
Missing50
Missing (%)16.9%
Infinite0
Infinite (%)0.0%
Mean537.2568960511034
Minimum268.625
Maximum1387.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum268.625
5-th percentile314.28125
Q1411.5535714
median535.7589286
Q3623.3125
95-th percentile766.71875
Maximum1387
Range1118.375
Interquartile range (IQR)211.7589286

Descriptive statistics

Standard deviation163.5755164
Coefficient of variation (CV)0.3044642472
Kurtosis3.514143979
Mean537.2568961
Median Absolute Deviation (MAD)123.2683072
Skewness1.154097903
Sum132165.1964
Variance26756.94956
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
414.5 2 0.7%
 
583.5 2 0.7%
 
555 2 0.7%
 
608.375 2 0.7%
 
682.125 2 0.7%
 
491.25 2 0.7%
 
347.375 2 0.7%
 
329.75 1 0.3%
 
462.5 1 0.3%
 
554.7142857 1 0.3%
 
Other values (229) 229 77.4%
 
(Missing) 50 16.9%
 
ValueCountFrequency (%) 
268.625 1 0.3%
 
269.125 1 0.3%
 
276.5 1 0.3%
 
282.625 1 0.3%
 
287 1 0.3%
 
ValueCountFrequency (%) 
1387 1 0.3%
 
1225.714286 1 0.3%
 
1026.428571 1 0.3%
 
1023.625 1 0.3%
 
924 1 0.3%
 

meanwd_udsventa
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
Distinct count6
Unique (%)2.4%
Missing42
Missing (%)14.2%
Infinite0
Infinite (%)0.0%
Mean539.0817874183757
Minimum425.43243243243245
Maximum721.421052631579
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum425.4324324
5-th percentile425.4324324
Q1427.575
median546.7105263
Q3627.075
95-th percentile721.4210526
Maximum721.4210526
Range295.9886202
Interquartile range (IQR)199.5

Descriptive statistics

Standard deviation107.504357
Coefficient of variation (CV)0.1994212372
Kurtosis-1.13015028
Mean539.0817874
Median Absolute Deviation (MAD)92.67711065
Skewness0.5035262163
Sum136926.774
Variance11557.18677
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
627.075 43 14.5%
 
546.7105263 43 14.5%
 
427.575 42 14.2%
 
721.4210526 42 14.2%
 
425.4324324 42 14.2%
 
484 42 14.2%
 
(Missing) 42 14.2%
 
ValueCountFrequency (%) 
425.4324324 42 14.2%
 
427.575 42 14.2%
 
484 42 14.2%
 
546.7105263 43 14.5%
 
627.075 43 14.5%
 
ValueCountFrequency (%) 
721.4210526 42 14.2%
 
627.075 43 14.5%
 
546.7105263 43 14.5%
 
484 42 14.2%
 
427.575 42 14.2%
 

roll4wd_udsstock
Real number (ℝ≥0)

MISSING
Distinct count239
Unique (%)85.7%
Missing17
Missing (%)5.7%
Infinite0
Infinite (%)0.0%
Mean854.271556579621
Minimum234.75
Maximum2209.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum234.75
5-th percentile407
Q1643.1071429
median815.8
Q31039.571429
95-th percentile1364.7
Maximum2209
Range1974.25
Interquartile range (IQR)396.4642857

Descriptive statistics

Standard deviation311.2570991
Coefficient of variation (CV)0.3643538131
Kurtosis1.378515585
Mean854.2715566
Median Absolute Deviation (MAD)242.720032
Skewness0.8119969193
Sum238341.7643
Variance96880.98173
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
407 6 2.0%
 
504 5 1.7%
 
765 4 1.4%
 
1085 3 1.0%
 
590.25 2 0.7%
 
790.4 2 0.7%
 
416 2 0.7%
 
769.4285714 2 0.7%
 
1240 2 0.7%
 
678 2 0.7%
 
Other values (229) 249 84.1%
 
(Missing) 17 5.7%
 
ValueCountFrequency (%) 
234.75 1 0.3%
 
283.25 1 0.3%
 
324.5 1 0.3%
 
329 1 0.3%
 
339 1 0.3%
 
ValueCountFrequency (%) 
2209 1 0.3%
 
2029.75 1 0.3%
 
1850.5 1 0.3%
 
1676 1 0.3%
 
1671.25 1 0.3%
 

meanwd_udsstock
Real number (ℝ≥0)

Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean866.7360099309544
Minimum634.952380952381
Maximum1121.2222222222222
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum634.952381
5-th percentile634.952381
Q1686.04
median836.6
Q31026.206897
95-th percentile1121.222222
Maximum1121.222222
Range486.2698413
Interquartile range (IQR)340.1668966

Descriptive statistics

Standard deviation171.2294205
Coefficient of variation (CV)0.1975566015
Kurtosis-1.473227951
Mean866.7360099
Median Absolute Deviation (MAD)155.459912
Skewness0.09458312511
Sum256553.8589
Variance29319.51444
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 634.95238095 660.49619048 1011.5 1121.22222222], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1026.206897 43 14.5%
 
836.6 43 14.5%
 
1121.222222 42 14.2%
 
634.952381 42 14.2%
 
762.2580645 42 14.2%
 
996.7931034 42 14.2%
 
686.04 42 14.2%
 
ValueCountFrequency (%) 
634.952381 42 14.2%
 
686.04 42 14.2%
 
762.2580645 42 14.2%
 
836.6 43 14.5%
 
996.7931034 42 14.2%
 
ValueCountFrequency (%) 
1121.222222 42 14.2%
 
1026.206897 43 14.5%
 
996.7931034 42 14.2%
 
836.6 43 14.5%
 
762.2580645 42 14.2%
 

roll4wd_udsprevisionempresa
Real number (ℝ≥0)

MISSING
Distinct count229
Unique (%)98.7%
Missing64
Missing (%)21.6%
Infinite0
Infinite (%)0.0%
Mean2830.7518626847295
Minimum85.0
Maximum15421.0
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum85
5-th percentile211.0142857
Q11309.736607
median2308.1875
Q33775.2625
95-th percentile6756.7625
Maximum15421
Range15336
Interquartile range (IQR)2465.525893

Descriptive statistics

Standard deviation2357.303113
Coefficient of variation (CV)0.8327480569
Kurtosis7.09494101
Mean2830.751863
Median Absolute Deviation (MAD)1646.196307
Skewness2.18795865
Sum656734.4321
Variance5556877.968
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
88 2 0.7%
 
147 2 0.7%
 
2579.75 2 0.7%
 
2828 1 0.3%
 
2031.8 1 0.3%
 
2380.125 1 0.3%
 
1892.25 1 0.3%
 
2877 1 0.3%
 
2322.625 1 0.3%
 
1991.375 1 0.3%
 
Other values (219) 219 74.0%
 
(Missing) 64 21.6%
 
ValueCountFrequency (%) 
85 1 0.3%
 
88 2 0.7%
 
104.8571429 1 0.3%
 
147 2 0.7%
 
161.75 1 0.3%
 
ValueCountFrequency (%) 
15421 1 0.3%
 
13811 1 0.3%
 
12574 1 0.3%
 
12204.25 1 0.3%
 
11310.5 1 0.3%
 

meanwd_udsprevisionempresa
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count7
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2267.130010577379
Minimum147.0
Maximum4634.894736842105
Zeros0
Zeros (%)0.0%
Memory size2.4 KiB

Quantile statistics

Minimum147
5-th percentile147
Q11069.916667
median2023.605263
Q33600.410256
95-th percentile4634.894737
Maximum4634.894737
Range4487.894737
Interquartile range (IQR)2530.49359

Descriptive statistics

Standard deviation1405.572389
Coefficient of variation (CV)0.6199787318
Kurtosis-0.9070440848
Mean2267.130011
Median Absolute Deviation (MAD)1169.726349
Skewness0.2230081453
Sum671070.4831
Variance1975633.74
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 147. 2335.39473684 4634.89473684], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2647.184211 43 14.5%
 
3600.410256 43 14.5%
 
2023.605263 42 14.2%
 
1069.916667 42 14.2%
 
1706.105263 42 14.2%
 
4634.894737 42 14.2%
 
147 42 14.2%
 
ValueCountFrequency (%) 
147 42 14.2%
 
1069.916667 42 14.2%
 
1706.105263 42 14.2%
 
2023.605263 42 14.2%
 
2647.184211 43 14.5%
 
ValueCountFrequency (%) 
4634.894737 42 14.2%
 
3600.410256 43 14.5%
 
2647.184211 43 14.5%
 
2023.605263 42 14.2%
 
1706.105263 42 14.2%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

df_indexfechaproductoudsstockudsventaudsprevisionempresapromofestivoweekdayquartermonthweekofyearworking_daysin_weekdaycos_weekdayis_augustspringsummerautumnwinterstockMissingTyperoll4wd_udsventameanwd_udsventaroll4wd_udsstockmeanwd_udsstockroll4wd_udsprevisionempresameanwd_udsprevisionempresa
0302019-06-0542765.0523.012574.00.00.022623True0.974928-0.222521001000.0523.00546.710526765.00836.60000012574.002647.184211
11022019-06-0642NaN752.015421.00.00.032623True0.433884-0.900969001002.0752.00627.075000NaN1026.20689715421.003600.410256
21742019-06-0742NaN383.013811.00.00.042623True-0.433884-0.900969001002.0383.00721.421053NaN996.79310313811.004634.894737
32462019-06-0842NaN553.05558.00.00.052623True-0.974928-0.222521001002.0553.00425.432432NaN1121.2222225558.001069.916667
43182019-06-0942NaNNaNNaN0.00.062623False-0.7818310.623490001002.0NaNNaNNaN634.952381NaN147.000000
53902019-06-1042NaN420.04591.00.00.002624True0.0000001.000000001002.0420.00484.000000NaN686.0400004591.002023.605263
64622019-06-1142649.0287.03404.00.00.012624True0.7818310.623490001000.0287.00427.575000649.00762.2580653404.001706.105263
75342019-06-12421298.0560.01797.00.00.022624True0.974928-0.222521001000.0532.25546.710526898.25836.6000009879.752647.184211
86062019-06-13421153.0686.02554.00.00.032624True0.433884-0.900969001000.0735.50627.0750001153.001026.20689712204.253600.410256
96782019-06-14421037.01830.03809.01.00.042624True-0.433884-0.900969001000.0744.75721.4210531037.00996.79310311310.504634.894737

Last rows

df_indexfechaproductoudsstockudsventaudsprevisionempresapromofestivoweekdayquartermonthweekofyearworking_daysin_weekdaycos_weekdayis_augustspringsummerautumnwinterstockMissingTyperoll4wd_udsventameanwd_udsventaroll4wd_udsstockmeanwd_udsstockroll4wd_udsprevisionempresameanwd_udsprevisionempresa
286206222020-03-1742NaN1107.01254.00.00.011312True0.7818310.623490000012.0395.375000427.575000819.285714762.2580652155.5001706.105263
287206942020-03-1842NaN501.02825.00.00.021312True0.974928-0.222521000012.0576.750000546.7105261055.500000836.6000003514.6252647.184211
288207662020-03-1942NaN405.02940.00.00.031312True0.433884-0.900969000012.0468.000000627.075000678.0000001026.2068974432.0003600.410256
289208382020-03-20421172.0NaN3062.00.00.041312True-0.433884-0.900969000010.0908.142857721.421053871.500000996.7931036285.3754634.894737
290209102020-03-21421967.0NaNNaN0.00.051312True-0.974928-0.222521000010.0896.714286425.432432976.4000001121.222222NaN1069.916667
291209822020-03-22421967.0NaNNaN0.00.061312False-0.7818310.623490010010.0NaNNaN1240.000000634.952381NaN147.000000
292210542020-03-23421967.0NaN1160.00.00.001313True0.0000001.000000010010.0338.000000484.0000001240.000000686.0400002034.5002023.605263
293211262020-03-24421498.0NaN329.00.00.011313True0.7818310.623490010010.0644.857143427.5750001057.200000762.2580651618.5001706.105263
294211982020-03-25421498.0NaN1673.00.00.021313True0.974928-0.222521010010.0546.428571546.7105261378.500000836.6000003054.8752647.184211
295212702020-03-26421498.0NaN673.00.00.031313True0.433884-0.900969010010.0378.857143627.0750001498.0000001026.2068973388.6253600.410256